Linking across forms in vertical scaling under the common-item nonequvalent groups design
نویسندگان
چکیده
The purposes of this dissertation are to compare how different the resulting proficiency estimates are by using two scale maintenance approaches, the horizontal and vertical approaches, in supporting scale stability across grade within form, within grade across forms, and across grades and across forms, and to thus investigate under which conditions of within-grade variability patterns and examinee sample characteristics one approach is preferable to the other. Since there is no universally accepted growth model in the literature, three different distribution sets were specified and generated with regard to within-grade variability patterns in the simulation data: constant across grades, decreasing variability as grade increases, and increasing variability as grade increases. In addition, two sets of examinee sample characteristics were also specified in the simulation data: small examinee group difference and large examinee group difference. Thus six proficiency distribution conditions were used to generate data in this dissertation. Under the six conditions of proficiency distributions, the performances of the two scale maintenance approaches on the resulting proficiency estimates across multiple forms were the foci in this dissertation. One major implication of this study is that the accuracy in recovering the true examinee proficiencies on the new form across multiple linking appeared to be an interaction among the number of forms or years involved in the linking process, the within-grade variability patterns, and the examinee group differences, but they did not appear to be a function of the maintenance approach implemented.
منابع مشابه
Investigating Common-Item Screening Procedures in Developing a Vertical Scale
Creating a vertical scale involves several decisions on assessment designs and statistical analyses to determine the most appropriate vertical scale. This research study aims at investigating common item stability check procedures to arrive at vertical linking item sets that will produce the necessary constants for computing vertical theta (ability) estimates and scale scores on a vertical scal...
متن کاملInvestigating Content and Construct Representation of a Common-item Design When Creating a Vertically Scaled Test
According to the equating guidelines, a set of common items should be a mini version of the total test in terms of content and statistical representation (Kolen & Brennan, 2004). Differences between vertical scaling and equating would suggest that these guidelines may not apply to vertical scaling in the same way that they apply to equating. This study investigated how well the guideline of con...
متن کاملGrowth Scales as an Alternative to Vertical Scales - Practical Assessment, Research & Evaluation
Student growth models depend on comparing assessments of individual students over time. Vertical scales (c.f. Kolen and Brennan, 2004) are among several options that exist for development of scales that allow these comparisons. Briefly, vertical scales are created through administering an embedded subset of items to different students at two educational levels, typically one year apart, and lin...
متن کاملVertical Scaling in Standards-Based Educational Assessment and Accountability Systems
This paper describes vertical scaling as a method for linking a set of test forms of increasing difficulty, and explores the applicability of these methods for standards-based educational achievement tests under status-based and growth-based accountability priorities. The purpose of the paper is to inform state policy-makers and assessment and accountability specialists about vertical scaling m...
متن کاملSeparate Versus Concurrent Estimation of IRT Item Parameters in the Common Item Equating Design
DOCUMENT RESUME TM 030 621 Hanson, Bradley A.; Beguin, Anton A. Separate versus Concurrent Estimation of IRT Item Parameters in the Common Item Equating Design. American Coll. Testing Program, Iowa City, IA. ACT-RR-99-8 1999-12-00 36p. ACT Research Report Series, PO Box 168, Iowa City, IA 52243-0168. Reports Evaluative (142) MF01/PCO2 Plus Postage. *Equated Scores; Estimation (Mathematics); *It...
متن کامل